Picture for Justin Cui

Justin Cui

LoL: Longer than Longer, Scaling Video Generation to Hour

Add code
Jan 23, 2026
Viaarxiv icon

Reward-Forcing: Autoregressive Video Generation with Reward Feedback

Add code
Jan 23, 2026
Viaarxiv icon

Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games

Add code
Oct 30, 2025
Viaarxiv icon

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Add code
Oct 02, 2025
Viaarxiv icon

Concepts or Skills? Rethinking Instruction Selection for Multi-modal Models

Add code
Aug 14, 2025
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Figure 1 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 2 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 3 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 4 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Viaarxiv icon

Latent Video Dataset Distillation

Add code
Apr 23, 2025
Figure 1 for Latent Video Dataset Distillation
Figure 2 for Latent Video Dataset Distillation
Figure 3 for Latent Video Dataset Distillation
Figure 4 for Latent Video Dataset Distillation
Viaarxiv icon

Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

Add code
Apr 09, 2025
Viaarxiv icon

Ameliorate Spurious Correlations in Dataset Condensation

Add code
Jun 06, 2024
Figure 1 for Ameliorate Spurious Correlations in Dataset Condensation
Figure 2 for Ameliorate Spurious Correlations in Dataset Condensation
Figure 3 for Ameliorate Spurious Correlations in Dataset Condensation
Figure 4 for Ameliorate Spurious Correlations in Dataset Condensation
Viaarxiv icon

OR-Bench: An Over-Refusal Benchmark for Large Language Models

Add code
May 31, 2024
Figure 1 for OR-Bench: An Over-Refusal Benchmark for Large Language Models
Figure 2 for OR-Bench: An Over-Refusal Benchmark for Large Language Models
Figure 3 for OR-Bench: An Over-Refusal Benchmark for Large Language Models
Figure 4 for OR-Bench: An Over-Refusal Benchmark for Large Language Models
Viaarxiv icon